Structure-Aware Procedural Text Generation From an Image Sequence

نویسندگان

چکیده

It is an important activity for our society to create new value by combining materials. From daily cooking manufacturing industry, we often describe the way do it as a procedural text. As pointed some previous studies natural language understanding, one property of text its dependency context, which merging operations materials and can be represented graph or tree structure. This paper aims investigate impact explicitly introducing such structure on vision task generation from image sequence. To this end, propose (1) dataset, extends definition version (2) novel structure-aware model, learns context efficiently. Experimental results show that proposed method boost performance traditional versatile methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Procedural Text Generation from an Execution Video

In recent years, there has been a surge of interest in automatically describing images or videos in a natural language. These descriptions are useful for image/video search, etc. In this paper, we focus on procedure execution videos, in which a human makes or repairs something and propose a method for generating procedural texts from them. Since available video/text pairs are limited in size, t...

متن کامل

Table-to-text Generation by Structure-aware Seq2seq Learning

Table-to-text generation aims to generate a description for a factual table which can be viewed as a set of field-value records. To encode both the content and the structure of a table, we propose a novel structure-aware seq2seq architecture which consists of field-gating encoder and description generator with dual attention. In the encoding phase, we update the cell memory of the LSTM unit by ...

متن کامل

Text Structure - Aware Classification

Bag-of-words representations are used in many NLP applications, such as text classification and sentiment analysis. These representations ignore relations across different sentences in a text and disregard the underlying structure of documents. In this work, we present a method for text classification that takes into account document structure and only considers segments that contain informatio...

متن کامل

Groundtruth Image Generation from Electronic Text (Demonstration)

The problem of generating synthetic data for the training and evaluating of document analysis systems has been widely addressed in recent years. With the increased interest in processing multilingual sources, there is a tremendous need to be able to rapidly generate data in new languages and scripts, without the need to develop specialized systems. We have developed an approach that uses langua...

متن کامل

Automatic FDP/FAP generation from an image sequence

This paper presents an automatic FDP (Facial Definition Parameters) and FAP (Facial Animation Parameters) generation method from an image sequence that captures a frontal face. The proposed method is based on facial feature tracking without markers on a face. We present an efficient method to extract 2D facial features and to generate the FDP by applying 2D features to a generic face model. We ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2020.3043452